Beyond bandlimited sampling of speech spectral envelope imposed by the harmonic structure of voiced sounds

نویسندگان

  • Hideki Kawahara
  • Masanori Morise
  • Tomoki Toda
  • Ryuichi Nisimura
  • Toshio Irino
چکیده

A new spectral envelope estimation procedure is proposed to recover details beyond band limitation imposed by the Shannon’s sampling theory when interpreting periodic excitation of voiced sounds as the sampling operation in the frequency domain. The proposed procedure is a hybrid of STRAIGHT, a F0-adaptive spectral envelope estimation and the auto regressive model parameter estimation. Wavelet analyses of these spectral models on the frequency domain enabled objective evaluation of this recovery procedure. The proposed procedure provides better speech quality especially when parameter manipulation is introduced.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Spectral envelope recovery beyond the nyquist limit for high-quality manipulation of speech sounds

A simple new method to recover details in a spectral envelope is proposed based on a recently introduced speech analysis, modification and resynthesis framework called TANDEMSTRAIGHT. Spectral envelope recovery of voiced sounds is a discrete-to-analog conversion in the frequency domain. However, there is a fundamental problem because the spatial frequency contents of vocal tract functions gener...

متن کامل

Estimating the spectral envelope of voiced speech using multi-frame analysis

This paper proposes a novel approach for estimating the spectral envelope of voiced speech independently of its harmonic structure. Because of the quasi-periodicity of voiced speech, its spectrum indicates harmonic structure and only has energy at frequencies corresponding to integral multiples of . It is hence impossible to identify transfer characteristics between the adjacent harmonics. In o...

متن کامل

Effect of voice quality on frequency-warped modeling of vowel spectra

The perceptual accuracy of an all-pole representation of the spectral envelope of voiced sounds may be enhanced by the use of frequency-scale warping prior to LP modeling. For the representation of harmonic amplitudes in the sinusoidal coding of voiced sounds, the effectiveness of frequency warping was shown to depend on the underlying signal spectral shape as determined by phoneme quality. In ...

متن کامل

Artificial Bandwidth Extension of Band Limited Speech Based on Vocal Tract Shape Estimation

This research addresses the challenge of improving degraded telephone narrowband speech quality caused by signal band limitation to the range of 0.3 3.4 kHz. We introduce a new speech bandwidth extension (BWE) algorithm which estimates and produces the high-band spectral components ranging from 3.4 kHz to 7 kHz, and emphasizes the lower spectral components around 300 Hz. Using a speech producti...

متن کامل

Influence of Differences between Inverse Filtering Techniques on the Residual Signal of Speech

Introduction Human speech production is characterized by two major processes: i) the source signal generation, which is either the quasi-periodic vibration of the vocal folds in voiced sounds, a turbulent airstream in voiceless sounds, or a combination of both in voiced fricatives, and ii) the slowly varying shape of the vocal tract causing a time-varying modulation of the spectral envelope of ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2013